Korpus: hrv_news_2020_1M

Weitere Korpora

3.6.2 Zipf's law for words of fixed lengths

Zipf distribution of words of fixed length 4, 6, 8, ..., 14.


Zipf's diagram for words of fixed length


Gnuplot diagram

Top Words of length 4
word rank frequency word
1 106888 koji
2 76062 kako
3 59016 nije
4 48080 koje
5 47754 biti
Top Words of length 6
word rank frequency word
1 36667 godine
2 22216 godina
3 18230 protiv
4 13286 kojima
5 12156 uvijek
Top Words of length 8
word rank frequency word
1 14712 nekoliko
2 13280 milijuna
3 10264 Hrvatske
4 9892 Hrvatska
5 9741 županije
Top Words of length 10
word rank frequency word
1 4602 posljednja
2 4317 vjerojatno
3 3519 pacijenata
4 3448 natjecanja
5 3263 aktivnosti
Top Words of length 12
word rank frequency word
1 16782 koronavirusa
2 5151 predsjednika
3 2184 istraživanja
4 2056 konferenciji
5 1997 Ministarstva
Top Words of length 14
word rank frequency word
1 2193 reprezentacije
2 1356 reprezentacija
3 1244 epidemioloških
4 1225 S druge strane
5 1151 gradonačelnika
Slope for length 4
Slope
-1.1465449924719908
Slope for length 6
Slope
-0.7048905995358938
Slope for length 8
Slope
-0.644715586526212
Slope for length 10
Slope
-0.6299995359305186
Slope for length 12
Slope
-0.8070215235847792
Slope for length 14
Slope
-0.8410314934384804
1347 msec needed at 2021-06-04 21:07